Systems Biological Approach of Molecular Descriptors Connectivity: Optimal Descriptors for Oral Bioavailability Prediction
نویسندگان
چکیده
BACKGROUND Poor oral bioavailability is an important parameter accounting for the failure of the drug candidates. Approximately, 50% of developing drugs fail because of unfavorable oral bioavailability. In silico prediction of oral bioavailability (%F) based on physiochemical properties are highly needed. Although many computational models have been developed to predict oral bioavailability, their accuracy remains low with a significant number of false positives. In this study, we present an oral bioavailability model based on systems biological approach, using a machine learning algorithm coupled with an optimal discriminative set of physiochemical properties. RESULTS The models were developed based on computationally derived 247 physicochemical descriptors from 2279 molecules, among which 969, 605 and 705 molecules were corresponds to oral bioavailability, intestinal absorption (HIA) and caco-2 permeability data set, respectively. The partial least squares discriminate analysis showed 49 descriptors of HIA and 50 descriptors of caco-2 are the major contributing descriptors in classifying into groups. Of these descriptors, 47 descriptors were commonly associated to HIA and caco-2, which suggests to play a vital role in classifying oral bioavailability. To determine the best machine learning algorithm, 21 classifiers were compared using a bioavailability data set of 969 molecules with 47 descriptors. Each molecule in the data set was represented by a set of 47 physiochemical properties with the functional relevance labeled as (+bioavailability/-bioavailability) to indicate good-bioavailability/poor-bioavailability molecules. The best-performing algorithm was the logistic algorithm. The correlation based feature selection (CFS) algorithm was implemented, which confirms that these 47 descriptors are the fundamental descriptors for oral bioavailability prediction. CONCLUSION The logistic algorithm with 47 selected descriptors correctly predicted the oral bioavailability, with a predictive accuracy of more than 71%. Overall, the method captures the fundamental molecular descriptors, that can be used as an entity to facilitate prediction of oral bioavailability.
منابع مشابه
In-silico prediction of Cellular Responses to Polymeric Biomaterials from Their Molecular Descriptors
In this work quantitative structure activity relationship (QSAR) methodology was applied for modeling and prediction of cellular response to polymers that have been designed for tissue engineering. After calculation and screening of molecular descriptors, linear and nonlinear models were developed by using multiple linear regressions (MLR) and artificial neural network (ANN) methods. The root m...
متن کاملA Priori Prediction of Tissue: Plasma Partition Coefficients (Log BP) of Drugs to Facilitate the Use of MLR and MLR-GA Methods
It is important to determine whether a candidate molecule is capable of penetrating the plasma-brain barrier indrug discovery and development. The aim of this paper is to establish a predictive model for plasma-brainbarrier penetration using simple descriptors The usefulness of the quantum chemical descriptors, calculated atthe level of the DFT and HE theories using 6-310* basis set for QSAR st...
متن کاملNovel Atom-Type-Based Topological Descriptors for Simultaneous Prediction of Gas Chromatographic Retention Indices of Saturated Alcohols on Different Stationary Phases
In this work, novel atom-type-based topological indices, named AT indices, were presented as descriptors to encode structural information of a molecule at the atomic level. The descriptors were successfully used for simultaneous quantitative structure-retention relationship (QSRR) modeling of saturated alcohols on different stationary phases (SE-30, OV-3, OV-7, OV-11, OV-17 and OV-25). At first...
متن کاملA QSAR Study of HIV Protease Inhibitors Using Computational Descriptors to Prediction of pki of Cycle Derivatives of Urea
Preventing and reducing the spread of HIV (HIV) has always been a concern in medical science. One of the most common ways to control the virus is using enzyme-blocking drugs. In this study, we attempted to predict the biological activity (PKi) of organic urea derivatives in protease inhibitor compounds using molecular modeling using QSAR (Quantitative Structure Activity Relation), which is the ...
متن کاملHuman Oral Bioavailability Prediction of Four Kinds of Drugs
In the development of drugs intended for oral use, good drug absorption and appropriate drug delivery are very important. Now the predictions for drug absorption and oral bioavailability follow similar approach: calculate molecular descriptors for molecules and build the prediction models. This approach works well for the prediction of compounds which cross a cell membrane from a region of high...
متن کامل